AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
High-fidelity Audio

# High-fidelity Audio

Csm 1b
Apache-2.0
CSM (Conversational Speech Model) is a 1B-parameter speech generation model developed by Sesame, capable of generating RVQ audio encoding from text and audio inputs.
Speech Synthesis English
C
unsloth
2,667
5
Csm 1b
Apache-2.0
A PyTorch-based text-to-speech model supporting Chinese speech synthesis, developed and released by SesameAILabs.
Speech Synthesis
C
nielsr
18
3
Sepformer Dns4 16k Enhancement
Apache-2.0
This is a speech enhancement model based on the SepFormer architecture, specifically designed for denoising tasks. It was trained on the Microsoft DNS-4 dataset and supports audio processing at a 16kHz sampling rate.
Audio Enhancement Supports Multiple Languages
S
speechbrain
1,669
20
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase